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ABSTRACT 

Some  heuristic  tests  for  detection  of  existence  of  errors  in 
inspection  are  proposed.  Some  of  these  tests  are  only  effective 
if  the  sampling  function  ((sample  size)/(lot  size))  is  rather  large, 
and  in  all  cases  their  application  predicates  special  experiments  to 
provide  the  requisite  data.  Feasibility  of  these  experiments  will  vary 
according  to  specific  circumstances. 

Estimation  of  the  probability  of  detection  is  discussed. 


1.  INTRODUCTION 


Recent  papers  (Johnson  et  al.)(1980),  Johnson  &  Kotz  (1981),  Kotz  & 
Johnson  (1982)  have  developed  distributions  of  observed  numbers  of  apparently 
defective  items  when  sample  inspection  is  imperfect,  resulting  in 
some  defectives  not  being  observed  as  such,  while  possibly  some  non-defectives 
are  described  as  'defective'  ("false  positives").  Although  these  results 
are  of  interest,  some  more  practical  problems  arise  when  it  is  desired  to 
test  whether  inspection  is^  faulty  or  to  estimate  the  degree  of  imperfection. 

It  is  the  object  of  this  paper  to  discuss  some  aspects  of  these  problems, 
keeping  in  mind  possible  practical  constraints  on  availability  of  data. 

2.  SCOPE  OF  THE  PROBLEM 

It  is  clear  that  detection  of  faulty  inspection  will  usually  call  for 
special  investigation;  the  possible  forms  of  such  investigation  can  be  limited 
by  practical  constraints.  We  will  give  a  few  possible  modes  of  attack,  but 
will  deal  here  with  the  very  simple  case  in  which  random  samples  of  size  n 
are  taken  from  a  lot  of  size  N,  containing  X  (unknown)  defective  items, 
with  constant  probability,  p  for  each  defective  item,  that  it  will  indeed  be 
classified  as  'defective'  on  inspection,  and  with  zero  probability  of  false 
positives.  Even  with  these  simplifying  assumptions,  detection  of  faulty 
inspection  will  often  be  difficult;  and  there  are  clearly  many  possibilities 
of  complication.  For  example  it  may  well  be  that  the  class  'defectives'  is 
not  homogeneous;  some  may  be  more,  and  some  less  easily  detected.  (Sampling 
fpdM  such  stratified  populations  is  discussed  in  Johnson  &  Kotz  (1981).) 

If  sampling  is  with  replacement.  It  will  not  be  possible  to  distinguish 
faulty  from  perfect  Inspection  merely  on  the  basis  of  a  succession  of  values 
Zj,Z2»...  of  the  total  number  (Z)  of  apparently  defective  items  in  routine 
samples  of  size  n.  This  Is  because  In  this  case  each  Z  will  have  a  binomial 


distribution  with  parameters  n,  pX/N  and  it  will  not  be  possible  to  separate 
the  unknowns  p  and  X,  and  so  not  possible  to  test  the  hypothesis  p  =  1.  Some 
discrimination  is  possible  if  sampling  is  without  replacement,  but  this  will 
naturally  be  weak  especially  if  the  sampling  fraction  (f=n/N)  or  the  propor¬ 
tion  (e=X/N)  of  defective  items  is  small.  This  problem  is  discussed  in 
Section  3.  Further  possibilities  of  specially  designed  experiments  are 
described  and  discussed  in  Section  4.  Estimation  of  p,  and  yet  more  possible 
special  designs  or  discussed  in  Section  5. 

3.  AN  EXPERIMENT  TO  TEST  FOR  FAULTY  INSPECTION 

We  will  suppose  Z^,Z2,,,,Zm  (the  number  of  items  declared  defectives  in 
successive  samples)  mutually  independent.  This  would  require  the  return  to 
the  population  of  all  n  items  in  each  sample  before  the  next  sample  is  chosen 
(even  though  for  each  sample,  selection  is  without  replacement)  but  does  not 
call  for  identification  of  previously  chosen  items.  We  will  suppose  this 
done,  but  note  that  it  is  evident  that  it  will  not  always  be  possible  -  for 
example,  if  testing  is  destructive.  In  such  cases  we  would  have  dependence 
among  the  Z's,  arising  from  dependence  among  the  corresponding  numbers  (Y)  of 
actual  defective  items  in  the  samples  -  in  fact 


Pr[  71  (Y.v.JIX] 

j-1  J  J 


N  -  X 


X  -  . ym  N-x-mn  ♦  ^yj.n-yv-.n-y. 


N-mn,n,. ..  ,nj 


where  ( .  .  a  .  \»a!/(Jlb4!).  We  will  not  discuss  this  case  further  here. 

vb0»b-j »...  ,bffl  j  j,n  J 

The  distribution  of  each  Z  is  the  hypergeometric-binomial 


Pr[Z.2|X,p]  -  (J)-'  l  (})(J:J>tJ)l>Z«J-P)y'Z 

(max(0,n-N+X)  <  2  £  min(n,X)) 

p 

(Johnson  et  al.  (1980)).  The  mean  (y)  and  variance  (o  )  of  Z  are 


y  =  np  X/N{=  npy  =  Npfh) 


(2.1) 


=  n{pe(1-  p9)~  p^9(  1-ft) } 

-  Jpr  g=|(1-p)y  [;(1-f)n(1-n'1y)+  f(1-p)|.j  (2.2) 


More  generally  the  r-th  factorial  moment  of  Z  is 


,(Z)  =  E[Z(r)]  =  E[Z(Z-1 ) . . (Z-r+1 )]  =  n(r)X(r)pr/N(r) 


(2.3) 


Formally  our  problem  is  that  of  testing  the  hypothesis  Ho:p  =  1,  with  X 
as  nuisance  parameter.  It  is  clear  that  Hq  cannot  be  tested  on  the  basis  of  a 
single  observed  Z  value.  Even  with  a  set  of  independent  values  Z-j  ,Z2,. .  .Z^, 
(obtainable  if  each  sample  of  m  items  is  returned  to  the  lot  after  inspection) 
construction  of  a  test,  which  is  optimal  in  some  reasonable  way,  presents 
technical  difficulties. 

—  -1  111  2  2 

Since  y  is  estimated  unbiasedly  by  Z  =  m  V  Zi ,  and  o  by  S  = 

_1  m  _ 2  i  =  l  2 

(m-1)  I  (Z^-Z)  ,  it  is  natural  to  seek  for  some  function  of  and  cr  which 

i=l  —22 

depends  only  on  p  (and  not  on  X)  and  then  replace  u  by  Z  and  o  by  S  .  From 


(2.2) 


«  *  aV  +  n’1 (N-l)”1 (N-n)y  =  l-(N-l)-1(n-l )p 


(2.4) 


Is  such  a  function  and  so  we  consider  using 

~~  '  W  =  S?Z  _1+  n"1(N-l)"1(N-n)Z  (3) 

as  a  test  criterion.  The  hypothesis  HQ(p=l)  corresponds  to  ft  s  (N-l)‘^ (N-n) ; 
for  alternative  hypotheses  (p  <  1),  ft  >  (N-l)_1(N-n)  « 


Using  standard  approximate  ("statistical  differentials")  formulae,  we  have 


E[W|X,p]  =  ECS2!'1  |X,p]  +  N'^N-D'^N-nJp  X 

i  °2  o  -  a»tsLllJL£)  ♦  atgifei , +  n-Vir'(N-nh. 

M  cTp  U 

2 

«£  [1  .  -  (^)2}]  +  n'1  (N-l )_1  (N-n)y  (see  Neyman  (1926)) 


*  a  -  m-1o(£)2(/e1~) 

•  \\  I  U 


(4.1) 


Also 


var(W(X,p)  =  var(S2Z-1 | X,p)+2n~ '(N-l )" '(N-n)cov(S^Z -  l ,  Z|X,p) 
+  n'2(N-l)'2(N-n)2var(7|X,p) 

A  a4rvar(S2|X,p)  2cov(S2 ,!| X ,p)  ,  var(Z|X,p)o 
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where  /6-j,  &2  are  the  moment  ratios 


(4.2) 


^  *  o'3E[(Z-u)3!X,p];  Bo  =  o~4E[(Z-)i)4|X.p]  I 

of  the  distribution  (1). 

When  the  null  hypothesis  (p  =  1)  is  valid,  this  distribution  is  hyper- 
geopietric  with  parameters  (n,X,N);  the  appropriate  formulae  for  and 
given  in  Johnson  &  Kotz  (1969,  p.  144).  If  N  is  large,  so  that 
(N-an)/(N+b)  4  1  then 


/81  t  1  h  »  3  +  HTnr0-)[i-«<fO-f)«O-0)i]  (6) 


where  n'  *  n(l-f).  We  also  have 

u  =  n'o 
o2=  n'O(l-e) 


(7.1) 

(7.2) 


Inserting  these  values  in  (4.1)  and  (4.2)  with  p  =  1 ,  we  obtain 

E[W|X,1]  r  *  -°'-eK2LV— f-~-  (8.1) 

2 

var(W|X,l)  •;  +  ^efr-J){f2{f0  +  2o-2)2-2f(l-f)-2n(  1-e) 


-  1 6f  ( 1  -f  )C(l-i.))] 

*  TO  5(,-e) 


(8.2) 


where  g(f,e)  is  a  number  of  magnitude  about  1  (for  example  if  f  =  0  =  ^  , 


9(f.6)  -  ]i\  +  §  -  2)2  -  l  -  l  -  (16  x  tl  )  =  - 


1  ~  2 


16  '  64 


2  =  -1—1 
c  64;- 


4.  OTHER  APPROACHES 


•As  we  have  already  noted  the  most  straightforward  way  of  testing  the 
hypothesis  p  =  1  would  be  by  inspection  of  items  known  to  be  defective.  Standard 
methods  could  then  be  used  -  in  this  case  the  hypothesis  would  be  rejected 
as  soon  as  any  item  was  ‘found1  to  be  nondefective.  The  level  of  significance 
of  the  test  would  be  zero,  and  its  power  function  (!-pni)  where  m  is  the  number 
of  items  to  be  inspected. 

Such  an  inquiry  may  not  be  possible  (for  example  when  determination 
of  defectiveness  entails  destruction)  but  we  may  approach  it  by  repeated 
inspection  of  the  same  item  (not  knowing  whether  it  is  defective  or  not).  If 
any  two  Inspections  give  different  results,  then  we  know  that  p  /  1  (since 
we  are  here  assuming  that  a  nondefective  item  will  always  be  classified 
correctly).  Of  course,  if  the  item  is  really  nondefective  this  test  will  have 

no  discriminating  power,  since  everv  inspection  will  result  in  a  correct 
decision. 
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Soinet itiies  it  may  be  possible  to  repeat  inspection  of  the  same  sample 
(of  si2e  n)  m  times,  but  to  be  able  to  record  only  the  total  numbers 
(Tj • • »Tm)  °f  ^®ms  deemed  'defective'  at  each  inspection.  This  increases 
the  chance  of  including  at  least  one  truly  defective  item  among  those 
Inspected. 

Here,  again,  if  any  two  T's  are  unequal,  then  we  know  that  HQ  is  not 
true  (i.e.  p  <  1)  since  decision  in  regard  to  at  least  one  item  rust  differ 
in  the  two  corresponding  inspections.  If  there  are  really  y  defective  items 
among  the  n  subjected  to  inspection,  the  probability  that  departure  of  p  from 
1  will  be  detected,  as  a  consequence  of  at  least  two  of  the  T's  differing  is 

i  -  f 

t=o  1 

The  overall  probability,  if  the  sample  is  chosen  without  replacement  from  a 
population  of  size  N  containing  just  X  defectives  is 

i  -  e[  y  {(V(i-P)v't  mi 

t=0  1 

where  Y  has  a  hypergeometric  distribution  with  parameters  (n,X,N). 

5 .  ESTIMATION  OF  THE  PROBABILITY  OF  DETECTION  ( p ) 

So  far  we  have  considered  only  testing  for  the  existence  of  errors  in 
inspection  which  lead  to  nondetection  of  defective  items.  If  the  existence  of 
such  errors  is  established,  it  is  natural  to  attempt  to  estimate  p.  With 
data  of  the  kind  used  in  the  test  statistic  W  (Section  3)  a  natural  estimator 
would  be  (in  view  of  (4.1))  (n-l)(N-l)"^(1-W),  possibly  with  a  bias  correction. 
For  reasonable  accuracy  however,  we  need  data  of  the  kind  described  towards 
the  end  of  Section  4. 

If  we  were  in  the  fortunate  position  of  having  a  number  of  items  known  to 
be  defective  we  could  obtain  a  simple  estimator  (of  p)  by  testinq  them  reoeatedly 
and  estimating  p  by  the  proportion  of  times  a  'defective'  decision  is  obtained. 


If  this  is  not  the  case  at  the  beginning  of  the  investigation  we  might, 
however,  be  in  a  position  to  exploit  the  fact  that  an  item  declared  'defective 
at  any  time  must  (according  to  our  assumptions)  be  defective.  Denoting  by 


N.  the  number  of  items  declared  defective  just  j  times  in  m  trials  (so  that 
J 

Nq  +  N-J+...+  Nm  =  n,  and  if  p  =  1,  NQ  =  n-y,  Nm  =  y)  a  plausible  but 
specious  argument  might  run  as  follows: 


"For  each  item,  we  discard  the  first  'defective'  decision  and  observe 

m 

the  proportion  of  defective  decision  in  the  remaining  \  fJj  sets  of 

(m-1)  trials.  Since  these  are  independent,  the  total  number  of 

m 

defectives  in  the  trials  has  a  binomial  ((m-1)  7  N.,p)  distribution, 

j=l  J 

and  our  estimate  of  p  is  unbiased,  with  an  easily  computed  standard 
deviation. " 


(It  is  not  difficult  to  see  that  this  will  produce  a  negatively  biased  esti¬ 
mator  of  p,  because  in  all  the  tirals  which,  are  thrown  away  a  decision  of 
'defective'  is  reached.  In  fact  the  estimation  is 

m 


1 


T  (j-l)M. 

(m-1)  l  N 
j*l  J 

and  its  expected  value  is 

ML 


1 


m-1 


I  JN. 

JH  i 


m 

L  *  Ni 
Lj  =  l  J 


(9) 


{hpt 


l-(l-p) 


m 


-  1 


}•> 


(10) 


We  could  take  notice  of  only  those  trials  following  the  first  'defective' 
decision;  although  this  will  not  use  all  the  information  available,  it  does 
lead  to  simple  formulae.  The  observed  proportion  of  'defective'  decisions  is 
no  an  unbiased  estimator  of  p;  the  (conditional)  distribution  of  the  number 
of  'defective  decision  counted  is  binomial  with  parameters  (total  number  of 
Inspections  Included  in  count,  p). 
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Suppose,  now,  that  we  have  m  repeated  inspections  of  a  set  of  n  items 
(among  which  an  unknown  number  y  are  defective)  and  have  been  able  to  record 
the  results  of  individual  inspections  (and  not  just  total  number  of  decisions 
of  'defective1  for  each  inspection  of  the  set  of  n  items).  The  likelihood 
function  is 


y-n+N, 


,,N, . ,NnJ(l-p)m(y-n+No>  E  {<”)PJ<1-P)m-3}  J 


y-n+Ng  ,N-|  , .  • .  ,N 


m 


in 

F.jN . 


m 


/  m  m  \  ,j  my-dN. 
\li  (?)/  P1  (1-P)  1  J 

j  =  l  J 


(ID 


(n-Nn  s  y  <  n) 


m 


where  Nn  =  n  -  V  N.  is  the  number  of  items  which  are  not  declared  defective 

j  =  l  J  m 

is  any  of  the  m  inspections.  Note  that  (Nn,  >  jN.)  is  a  sufficient 

U-  j=l  J 

statistic  for  (y,p);  7jN  is  the  total  number  of  ’defective'  decisions.  If 


y  were  known,  the  maximum  likelihood  estimator  of  p  would  be 


p(y)  =  (niy)-1  y  jN. 

j=l  J 

The  corresponding  maximized  log  likelihood  would  be 


m 


(12) 


n-Ng+1 


m 


m 


log  £(y)  =  K  +  l  log  (y-i)  +  (my-  )  jN.)log(my-  l  jN.)-my  log  my  (13) 
i=0  j=l  J  j=l  J 

where  K  does  not  depend  on  y.  We  then  seek  to  minimize  (13)  with  respect  to 

y,  subject  to  n  >  y  2  n  -  NQ.  Note  that  we  are  not  primarily  interested  in 

the  value  of  y  itself,  but  we  need  the  value,  y,  maximizing  (13)  to  calculate 

the  maximum  likelihood  estimator,  p(y) ,  of  p. 

m 

A  useful  practical  method  is  obtained  by  noting  that  for  the  l  N.  items 

j=l 

which  we  know  to  be  defective,  the  numbers  of  times  each  item  is  declared 
'defective'  can  be  regarded  as  observed  values  of  independent  random  variables 


» 
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each  having  a  binomial  (m,p)  distribution,  truncated  by  omission  of  zero 
values.  Equating  sample  means  and  expected  values  gives  the  equation 

p{l-(l-p)m}_1  =  m'V-Nj'1  ™  jN.  (14) 

u  j=l  3 


for  an  estimator  of  p.  This  estimator  is,  in  fact,  the  conditional  maximum 

likelihood  estimator  of  p,  given  Ng. 

If  we  do  require  to  get  a  estimator  of  y,  we  note  that,  with 

.  m 

P  ■  P,  E[  I  N.  =  n  -  N0jy,p]  =  y{l-(l-p)}. 
j=1  J  u 

Replacing  expected  by  actual  values,  we  get  the  the  estimator 


y=  min(n,  ("(n-NgJO-Jl-p)111}’^ 


)=  min(n. 


(mp)"1 


m 


) 


(15) 


where  [  ]  denotes  'nearest  integer  to'. 

As  a  numerical  example  suppose  we  test  each  of  50  (=n)  items  three  (=ir.) 
times  and  obtain 


N0  =  43;  N1  *  1;  N2  *  1;  N3  «*5 

so  that 

3 

7  jN.  =  1+2+15  =  18.  Equation  (14)  gives 

j-1  0 

p{3p  -  3p2  +  p3r]  =  (3  7 ) ~ 1 18  =  6/7 


where 

leading  to 
From  (15)  , 


y  =  mi 


p  =  0.8545 
nf50. 


-LE. 


3  *  0.8545 


=  7  . 


Note  that  the  same  values  of  p  and  y  would  be  obtained,  whatever  the  value 


of  n  (2.  7). 
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